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ABSTRACT 

Accurate Medical diagnosis is not always possible at every medical 
center, especially in the Developing Countries where poor healthcare 
services and lack of advanced diagnostic methods and equipments 
affects procedures of medical diagnosis .Also, physician intuition and 
experience are not always sufficient to achieve high quality medical 
procedures results. Therefore, diagnostic errors and undesirable 
results are reasons for a need for Machine Learning Techniques 
based decision support system, which in turns reduce diagnostic 
errors, increasing the patient safety and save lives. This research 
focuses on this aspect of Medical diagnosis by learning pattern 
through the collected dataset of respiratory diseases such as: 
pneumonia and Covid-19, also consist implementation and test of 
intelligent medical decision support system to assist physicians and 
radiologists can deliver great assistance by improving their decision- 
making ability. In this Research paper, the proposed System use 
Neural network Resnet-50 and transfer learning technique to classify 
these severe diseases and performs evaluation of precision, accuracy, 
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specificity of Decision support system. 
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1, INTRODUCTION 

The major challenge facing the healthcare industry is 
the provision for quality services at affordable costs. 
A quality implies diagnosing patients correctly and 
treating them effectively. Poor clinical decisions can 
lead to disastrous results which is unacceptable. 
Medical diagnosis can be performed in an automatic 
way with the use of computer-based systems or 
algorithms. Such systems are usually called 
diagnostic decision support systems (DDSSs) or 
medical diagnosis systems (MDSs), which fall under 
the more general category of decision support system 
(DSS). The aim of these types of systems is to guide 
the physicians through the systematic differential 
diagnosis process. 


Respiratory Diseases are very serious health problems 
in the life of people. These diseases include chronic 
obstructive pulmonary disease, pneumonia, pleural 
effusion, COVID-19 and other lung diseases. The 
timely diagnosis of various Respiratory diseases 1s 
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very important. There are different types of Lung 
diseases which can be categorized by Accurate 
analysis of X-rays. Chest radiography is an 
economical and easy- to-use medical imaging and 
diagnostic technique. The technique is the most 
Innovative that used diagnostic tool in medical 
practice and has an important role in the diagnosis of 
the lung disease. Many image processing , Machine 
Learning and Deep Learning Approaches have been 
developed for this purpose. 


Deep learning-based techniques has been considered 
to be one of the most emerging & advanced 
techniques in recent years in computer aided systems. 
This allows Operators to visualize the results 
immediately and with great accuracy. Additionally, 
intuition and experience are frequently used by 
physicians to reach the medical verdicts instead of 
knowledge base. Consequently, — unsolicited 
preconceptions and excessive medical cost are 
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produced which may portend the patient’s lives. In 
this paper, It is proposed to build a Neural Network 
based decision support system for heart diseases 
diagnosis. Neural Network is considered to be 
extremely Accurate Systems in image processing as it 
provides accurate assessment of a disease by both 
image acquisitions and image interpretation. 


I. Machine learning approach 

Machine learning is a_ subfield of artificial 
intelligence (AI). The goal of machine learning 
generally is to understand the structure of data and fit 
that data into models that can be understood and 
utilized by people. 


Although machine learning 1s a field within computer 
science, it differs from traditional computational 
approaches. 


Machine learning algorithms instead allow for 
computers to train on data inputs and use statistical 
analysis in order to output values that fall within a 
specific range. Because of this, machine learning 
facilitates computers in building models from sample 
data in order to automate decision-making processes 
based on data inputs. Machine learning has become a 
powerful tool for analyzing medical domains, 
assessing the importance of clinical parameters, and 
extracting medical knowledge for outcomes research. 
ML approaches that result in good overall 
performance and provide medical staff with 
interpretable prognostic information, providing the 
ability to support decisions and to reduce the number 
of medical tests for a reliable prognosis are also 
desirable. A measure of reliability of the diagnosis or 
prognosis is also important, because this would give 
medical staff sufficient confidence to put the new 
approach into practice. 


Il. Introduction To Deep Learning 

Deep learning attempts to imitate how the human 
brain can process light and sound stimuli into vision 
and hearing. A deep learning architecture is inspired 
by biological neural networks and consists of multiple 
layers in an artificial neural network made up of 
hardware and GPUs. 


Deep learning attempts to imitate how the human 
brain can process light and sound stimuli into vision 
and hearing. A deep learning architecture is inspired 
by biological neural networks and consists of multiple 
layers in an artificial neural network made up of 
hardware and GPUs. 


Among the machine learning algorithms that are 
currently being used and developed, deep learning 
absorbs the most data and has been able to beat 
humans in some cognitive tasks. Because of these 
attributes, deep learning has become the approach 


with significant potential in the artificial intelligence. 
Computer vision and speech recognition have both 
realized significant advances from deep learning 
approaches. 


Hil. Neural Network 

Deep learning is a machine learning method inspired 
by the deep structure of a mammal brain. The deep 
structures are characterized by multiple hidden layers 
allowing the abstraction of the different levels of the 
features. 


This learning algorithm is seen as an unsupervised 
single layer greedily training where a deep network is 
trained layer by layer. Because this method became 
more effective, it has been started to be used for 
training many deep networks. One of the most 
powerful deep networks is the convolutional neural 
network that can include multiple hidden layers 
performing convolution and subsampling in order to 
extract low to high levels of features of the input data. 
This network has shown a great efficiency in different 
areas, particularly, in computer vision, biological 
computation, fingerprint enhancement, and so on. 
Basically, this type of networks consists of three 
layers: convolution layers, subsampling or pooling 
layers, and full connection layers. It relies on training 
data to learn and improve their accuracy over time. 
This may use for predictive modeling, adaptive 
control where they can be trained via dataset. 


2. METHODOLOGY 

I. Supervised Learning 

In supervised learning, the computer 1s provided with 
example inputs that are labelled with their desired 
outputs. The purpose of this method is for the 
algorithm to be able to “learn” by comparing its 
actual output with the “taught” outputs to find errors, 
and modify the model accordingly. Supervised 
learning therefore uses patterns to predict label values 
on additional unlabeled data. 





Figure 1. Deep Learning Workflow 


IT. Supervised Learning Workflow 

In supervised learning, the computer 1s provided with 
example inputs that are labelled with their desired 
outputs. The purpose of this method is for the 
algorithm to be able to “learn” by comparing its 
actual output with the “taught” outputs to find errors, 
and modify the model accordingly. Supervised 
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learning therefore uses patterns to predict label values 
on additional unlabeled data. 


Supervised Learning workflow 


Train: Iterate until you find the best model 





[pram enan nem em on een em mm, 


Predict: Integrate trained models into applications 


PREDICTION 


NEW 
DATA 


w) > > Soca 
Figure 2. Supervised Learning Workflow 


II. Transfer Learning 
Classification 
Transfer learning is a very popular approach in 
computer vision related tasks using deep neural 
networks when data resources are scarce. Therefore, 
to launch a new task, we incorporate the pre-trained 
models skilled in solving similar problems. This 
method 1s crucial in medical image processing due to 

the shortage of real samples. 


in Lung Diseases 


In deep neural networks, feature extraction is carried 
out but passing raw data through models specialized 
in other tasks. Here, we can refer to deep learning 
models such as ResNet-50, where the last layer 
information serves as input to a new classifier. 
Transfer learning in deep learning problems can be 
performed using a common approach called pre- 
trained model’s approach. Restructured Model states 
that pre-trained model can produce a starting point for 
another model used in a different task. This involves 
incorporation of the whole model or its parts. 


The adopted model may or may not need to be refined 
on the input-output data for the new task. The third 
option considers selecting one of the available 
models. It is very common that research institutions 
publish their algorithms trained on challenging 
datasets which may fully or partially cover the 
problem stated by a new task. 


IV. ResNet-50 Neural Network 

ResNet, short for Residual Network is a specific type 
of neural network that was introduced in 2015. 
ResNet-50 is a convolutional neural network that is 
50 layers deep. 


ResNet50 is a variant of ResNet model which has 48 
Convolution layers along with 1 MaxPool and 1 
Average Pool layer. It has 3.8 x 109 Floating points 
operations. It is a widely used ResNet model and we 
have explored ResNet50 architecture in depth. 


a Architecture of ResNet50 model 
Input 





Max pooling layer, Convolution layer 


Fully connected layer 


' 
eon sts) 


Figure 3. ResNet-50 Architecture 


The architecture of ResNet50 has 4 stages as shown 
in the diagram below. The network can take the input 
image having height, width as multiples of 32 and 3 
as channel 


width. For the sake of explanation, we will consider 
the input size as 224 x 224 x 3. Every ResNet 
architecture performs the initial convolution and max- 
pooling using 7x7 and 3x3 kernel sizes respectively. 
Afterward, Stage | of the network starts and it has 3 
Residual blocks containing 3 layers each. The size of 
kernels used to perform the convolution operation in 
all 3 layers of the block of stage 1 are 64, 64 and 128 
respectively. The curved arrows refer to the identity 
connection. The dashed connected arrow represents 
that the convolution operation in the Residual Block 
is performed with stride 2, hence, the size of input 
will be reduced to half in terms of height and width 
but the channel width will be doubled. As we 
progress from one stage to another, the channel width 
is doubled and the size of the input is reduced to half. 


For each residual function F, 3 layers are stacked one 
over the other. The three layers are 1x1, 3x3, Ixl 
convolution. The 1x1 convolution layers are 
responsible for reducing and then restoring the 
dimensions. The 3x3 layer is left as a bottleneck with 
smaller input/output dimensions. Finally, the network 
has an Average Pooling layer followed by a fully 
connected layer having 1000 neurons (Image output). 
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V. Dataset Exploration 

RADIOGRAPHY DATABASE 

A team of researchers from Qatar University, Doha, 
Qatar and the University of Dhaka, Bangladesh along 
with their collaborators from Asian Countries in 
collaboration with medical doctors have created a 
database of chest X-ray images for COVID-19 
positive cases along with Normal and Viral 
Pneumonia images. In our current release, there are 
219 COVID-19 positive images, 1341 normal images 
and 1345 viral pneumonia images. We will continue 
to update this database as soon as we have new x-ray 
images for COVID-19 pneumonia patients. 


This developed database of COVID-19 x-ray images 
from the Italian Society of Medical and Interventional 
Radiology (SIRM) COVID-19 DATABASE, Novel 
Corona Virus 2019 Dataset developed by Joseph Paul 
Cohen and Paul Morrison and Lan Dao in GitHub and 
images extracted from 43 different publications. 
References of each image are provided in the 
metadata. Normal and Viral pneumonia images were 
adopted from the Chest X-Ray Images (pneumonia) 
database. 


3. RESULT EVALUATION AND ANALYSIS 
I. Dataset 

We have total 300 sample images, Which Consist In 
order 100 of COVID-19 and 100 of PNEUMONIA 
and 100 of NORMAL Classes Chest X-ray images. 
For better understanding to train and test Deep neural 
network with this Large dataset we have portioned 
this dataset into several parts one for Training and 
second for Testing Purposes. Therefore, percentage- 
wise portioned training and testing dataset Into 3 sets 
D_1,D_2,D_3 gives chance to evaluate the accuracy 
and other parameters of Resnet-50 


Neural Network. 
All samples = 3x2 table 
| Flys OTT 


Dataset Name_ Training set(%) Testing set(%) 








Num_images = 300 
Table 1. Dataset Exploration 


II. Confusion Matrix 

A Confusion Matrix is a table that is often used to 
describe the performance of a classification model on 
a set of test data for which the true values are known. 


| D3 | 80 





To Measure the performance of our algorithm on the 

testing dataset. This matrix also lets us know the 

correctly classified images and miss-classified 

images. By observing the confusion matrix, we can 

Observe the performance of our deep learning Neural 

Network. There are Some Basic Terms, which are 

Useful to Interpret Confusion Matrix. 

> True positives (TP): These are cases in which we 
predicted yes (they have the disease), and they do 
have the disease. 

> True negatives (TN): We predicted no, and they 
don't have the disease. 

> False positives (FP): We predicted yes, but they 
don't actually have the disease. (Also known as a 
"Type I error.") 

> False negatives (FN): We predicted no, but they 
actually do have the disease. (Also known as a 
"Type II error.") 


Confusion Matrix: ResNet50 


Output Class 


PNEUMONIA 





Target Class 


Figure 4. Confusion Matrix 


HI. Composition and Evaluation 
To evaluate and validate the effectiveness of the 
proposed approach, we 


conducted the experiments So many _ times 


respectively. Parameters were 


heavily turned to increase the performance of the 
model. 


> For performance evaluation, we adopted 
Accuracy (ACC), and Sensitivity (SN)metrics 
from the confusion matrix. 


Accuracy(ACC) = cM jal Ed 
n 
fie 
Sensitivity(SN) TP FA 


> Where, TP is the true positive in case of abnormal 
and TN is the true negative in case of normal, 
while FP and FN are the incorrect model 
predictions for abnormal and normal cases, 
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= Sate 2 at class, for the Resnet-50. 
“ & = Accuracy = 94.6667 | Sensitivity = 0.9600 
= eZ Specificity = 0.9500 | Precision = 0.9057 
cal . i, 
During improvement, a Testing set was used to 
estimate the model. Fig. is a graphical Representation 
mr TN of the loss value against epoch. In the figures, ‘Train 
& Pp & 
AFP & FN loss’ indicates training loss, while ‘Test loss” 
indicates validation (testing) loss. 
80% Training and 20% test 60% Training and 40% test 50° Training and 50% test 
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> Loss error curves obtained by various sizes for the training set (blue curves) and testing set (red curves): (a) 
80% Training and 20% test, (b) 60% Training and 40% test, and (c) 50% Training and 50% test. 


IV. Accuracy Graph 

Accuracy is a commonly used classification metric and indicates how well a classification algorithm can 
discriminate the classes in the test set. The accuracy can be defined as the proportion of the predicted correct 
labels to the total number (predicted and actual) of labels. We have Achieved Highest Accuracy During Training 
and Testing phase of the Proposed Resnet-50 Neural Network. 
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The learning curve accuracy and error obtained by Resnet50- Neural network. 
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> Activation of a Linear Rectified Unit: 
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Image Activations From 'activation_2_relu' Layer 


V. Classification Output 

> Series of experiments with different proportion of sample datasets have been conducted to measure the 
effectiveness and accurateness of the proposed system. To ensure the correctness of the knowledge base and 
outputs, Neural Network is trained repeatedly and its parameters are modified where is needed. For Testing 
Phase: Loaded Several Images from the test dataset and classify into Three Different Class Using Trained 
Resnet-50 Neural Network. Results show that accurate outputs, with 94% training accuracy, are resulted 
with the trained samples, while, the system produces acceptable results for non-trained samples, with 98% 
classification accuracy. 


Predclass=COVID,score=0.97003, origClass=COVID Predclass=COVID,score=0.97318, origClass=COVID 


me ie 
. 





Predclass=NOR MAL,score=1, origClass=+NORMAL Predclass=NORMAL,score=1, origClass=NORMAL 
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Predclass=PNEUMONIA,score=0.98156, origClass=PNEUMONIA Predclass=PNEUMONIA,score=0.96443, origClass=PNEUMONIA 





4. CONCLUSION 

The creation of medical diagnosis systems is a 
problem and that has been studied since the early 
ages. Several techniques and technologies have been 
used in this field, including both knowledge 
representation tools and algorithms that perform the 
diagnosis. Most of the approaches are based on the 
creation of expert systems which capture the 
knowledge of a set of medical doctors in order to 
create a clinical decision support system. 


An innovative decision support system which is based 
on Neural Network and Machine Learning & Deep 
Learning based Approach has been implemented and 
tested to classify two main respiratory disease: Covid- 
19, Pneumonia. Furthermore, the system offers a 
promising opportunity to develop an operational 
screening and testing device for heart disease 
diagnosis. Also, the system can deliver great 
assistance for clinicians and radiologists to make 
advanced diagnosis respiratory disease. 


Using Patient Xray dataset, series of experiments 
have been conducted, to examine the performance 
and accuracy of the proposed system. Compared 
results revealed that the system performance and 
accuracy are excellent, with a heart diseases 
classification accuracy of 98%, results show that the 
proposed system delivers higher classification 
accuracy for both respiratory diseases. 


More work is planned to enrich the Neural Network 
and Machine Learning & Deep Learning based 
Approach knowledge base with more disease samples 
and enlarge the system to cover a wide range of 
respiratory diseases with high accuracy. Additional 
Neural Network training techniques can be adopted, 
including linear vector quantization (LVQ), 
perception and dynamic Neural Network. The 
approach has been validated through the case study; it 
is possible to expand the scope of modeled medical 
knowledge. Furthermore, in order to improve decision 


Validation (Test) Output 





= 





support, interactions should be considered between 
the different medications that the patient is on. 
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